519 research outputs found

    Run Generation Revisited: What Goes Up May or May Not Come Down

    Full text link
    In this paper, we revisit the classic problem of run generation. Run generation is the first phase of external-memory sorting, where the objective is to scan through the data, reorder elements using a small buffer of size M , and output runs (contiguously sorted chunks of elements) that are as long as possible. We develop algorithms for minimizing the total number of runs (or equivalently, maximizing the average run length) when the runs are allowed to be sorted or reverse sorted. We study the problem in the online setting, both with and without resource augmentation, and in the offline setting. (1) We analyze alternating-up-down replacement selection (runs alternate between sorted and reverse sorted), which was studied by Knuth as far back as 1963. We show that this simple policy is asymptotically optimal. Specifically, we show that alternating-up-down replacement selection is 2-competitive and no deterministic online algorithm can perform better. (2) We give online algorithms having smaller competitive ratios with resource augmentation. Specifically, we exhibit a deterministic algorithm that, when given a buffer of size 4M , is able to match or beat any optimal algorithm having a buffer of size M . Furthermore, we present a randomized online algorithm which is 7/4-competitive when given a buffer twice that of the optimal. (3) We demonstrate that performance can also be improved with a small amount of foresight. We give an algorithm, which is 3/2-competitive, with foreknowledge of the next 3M elements of the input stream. For the extreme case where all future elements are known, we design a PTAS for computing the optimal strategy a run generation algorithm must follow. (4) Finally, we present algorithms tailored for nearly sorted inputs which are guaranteed to have optimal solutions with sufficiently long runs

    Association of neurexin 3 polymorphisms with smoking behavior.

    Full text link
    The Neurexin 3 gene (NRXN3) has been associated with dependence on various addictive substances, as well as with the degree of smoking in schizophrenic patients and impulsivity among tobacco abusers. To further evaluate the role of NRXN3 in nicotine addiction, we analyzed single nucleotide polymorphisms (SNPs) and a copy number variant (CNV) within the NRXN3 genomic region. An initial study was carried out on 157 smokers and 595 controls, all of Spanish Caucasian origin. Nicotine dependence was assessed using the Fagerstrom index and the number of cigarettes smoked per day. The 45 NRXN3 SNPs genotyped included all the SNPs previously associated with disease, and a previously described deletion within NRXN3. This analysis was replicated in 276 additional independent smokers and 568 controls. Case-control association analyses were performed at the allele, genotype and haplotype levels. Allelic and genotypic association tests showed that three NRXN3 SNPs were associated with a lower risk of being a smoker. The haplotype analysis showed that one block of 16 Kb, consisting of two of the significant SNPs (rs221473 and rs221497), was also associated with lower risk of being a smoker in both the discovery and the replication cohorts, reaching a higher level of significance when the whole sample was considered [odds ratio = 0.57 (0.42-0.77), permuted P = 0.0075]. By contrast, the NRXN3 CNV was not associated with smoking behavior. Taken together, our results confirm a role for NRXN3 in susceptibility to smoking behavior, and strongly implicate this gene in genetic vulnerability to addictive behaviors

    X-chromosome tiling path array detection of copy number variants in patients with chromosome X-linked mental retardation

    Get PDF
    Contiene 3 ficheros adicionales con información suplementaria.-- et al.[Background] Aproximately 5–10% of cases of mental retardation in males are due to copy number variations (CNV) on the X chromosome. Novel technologies, such as array comparative genomic hybridization (aCGH), may help to uncover cryptic rearrangements in X-linked mental retardation (XLMR) patients. We have constructed an X-chromosome tiling path array using bacterial artificial chromosomes (BACs) and validated it using samples with cytogenetically defined copy number changes. We have studied 54 patients with idiopathic mental retardation and 20 controls subjects.[Results] Known genomic aberrations were reliably detected on the array and eight novel submicroscopic imbalances, likely causative for the mental retardation (MR) phenotype, were detected. Putatively pathogenic rearrangements included three deletions and five duplications (ranging between 82 kb to one Mb), all but two affecting genes previously known to be responsible for XLMR. Additionally, we describe different CNV regions with significant different frequencies in XLMR and control subjects (44% vs. 20%).[Conclusion] This tiling path array of the human X chromosome has proven successful for the detection and characterization of known rearrangements and novel CNVs in XLMR patients.The authors thank the "Genoma España" and Genome Canada joint R+D+I projects in human health, plants and aquiculture; the former "Departament d'Universitats i Societat de la Informació" (DURSI) and the "Departament de Salut", from the Catalan Autonomous Government (2005SGR00008 - Generalitat de Catalunya); the Instituto de Salud Carlos III (PI041126, CIBER-ESP), the EU's Sixth Framework Programme [FP6-2005-LIFESCIHEALTH-7; ANEUPLOIDY No. 037627] and Fundación Areces (U-2006-FARECES-O).Peer reviewe

    A new approach for identifying non-pathogenic mutations. An analysis of the cystic fibrosis transmembrane regulator gene in normal individuals

    Get PDF
    Given q as the global frequency of the alleles causing a disease, any allele with a frequency higher than q minus the cumulative frequency of the previously known disease-causing mutations (threshold) cannot be the cause of that disease. This principle was applied to the analysis of cystic fibrosis transmembrane conductance regulator (CFTR) mutations in order to decide whether they are the cause of cystic fibrosis. A total of 191 DNA samples fl-om random individuals from Italy, France, and Spain were investigated by DGGE (denaturing gradient gel electrophoresis) analysis of all the coding and proximal non-coding regions of the gene. The mutations detected by DGGE were identified by sequencing. The sample size was sufficient to select essentially all mutations with a frequency of at least 0.01. A total of 46 mutations was detected, 20 of which were missense mutations. Four new mutations were identified: 1341+28 C/T, 2082 C/T, L1096R, and I1131V. Thirteen mutations (125 G/C, 875+40 A/G, TTGAn, IVS8-6 5T, IVS8-6 9T, 1525-61 A/G, M470V, 2693 T/G, 3061-65 C/A, 4002 A/G, 4521 G/A, IVS8 TG10, IVS8 TG12) were classified as non-CF-causing alleles on the basis of their frequency. The remaining mutations have a cumulative frequency far exceeding q; therefore, most of them cannot be CF-causing mutations. This is the first random survey capable of detecting all the polymorphisms of the coding sequence of a gene

    High frequency of the IVS2-2A>G DNA sequence variation in SLC26A5, encoding the cochlear motor protein prestin, precludes its involvement in hereditary hearing loss

    Get PDF
    BACKGROUND: Cochlear outer hair cells change their length in response to variations in membrane potential. This capability, called electromotility, is believed to enable the sensitivity and frequency selectivity of the mammalian cochlea. Prestin is a transmembrane protein required for electromotility. Homozygous prestin knockout mice are profoundly hearing impaired. In humans, a single nucleotide change in SLC26A5, encoding prestin, has been reported in association with hearing loss. This DNA sequence variation, IVS2-2A>G, occurs in the exon 3 splice acceptor site and is expected to abolish splicing of exon 3. METHODS: To further explore the relationship between hearing loss and the IVS2-2A>G transition, and assess allele frequency, genomic DNA from hearing impaired and control subjects was analyzed by DNA sequencing. SLC26A5 genomic DNA sequences from human, chimp, rat, mouse, zebrafish and fruit fly were aligned and compared for evolutionary conservation of the exon 3 splice acceptor site. Alternative splice acceptor sites within intron 2 of human SLC26A5 were sought using a splice site prediction program from the Berkeley Drosophila Genome Project. RESULTS: The IVS2-2A>G variant was found in a heterozygous state in 4 of 74 hearing impaired subjects of Hispanic, Caucasian or uncertain ethnicity and 4 of 150 Hispanic or Caucasian controls (p = 0.45). The IVS2-2A>G variant was not found in 106 subjects of Asian or African American descent. No homozygous subjects were identified (n = 330). Sequence alignment of SLC26A5 orthologs demonstrated that the A nucleotide at position IVS2-2 is invariant among several eukaryotic species. Sequence analysis also revealed five potential alternative splice acceptor sites in intron 2 of human SLC26A5. CONCLUSION: These data suggest that the IVS2-2A>G variant may not occur more frequently in hearing impaired subjects than in controls. The identification of five potential alternative splice acceptor sites in intron 2 of human SLC26A5 suggests a potential mechanism by which expression of prestin might be maintained in cells carrying the SLC26A5 IVS2-2A>G DNA sequence variation. Additional studies are needed to evaluate the effect of the IVS2-2A>G transition on splicing of SLC26A5 transcripts and characterize the hearing status of individuals homozygous for the IVS2-2A>G variant

    The human early-life exposome (HELIX): project rationale and design

    Get PDF
    Background: Developmental periods in early life may be particularly vulnerable to impacts of environmental exposures. Human research on this topic has generally focused on single exposure–health effect relationships. The “exposome” concept encompasses the totality of exposures from conception onward, complementing the genome. Objectives: The Human Early-Life Exposome (HELIX) project is a new collaborative research project that aims to implement novel exposure assessment and biomarker methods to characterize early-life exposure to multiple environmental factors and associate these with omics biomarkers and child health outcomes, thus characterizing the “early-life exposome.” Here we describe the general design of the project. Methods: In six existing birth cohort studies in Europe, HELIX will estimate prenatal and postnatal exposure to a broad range of chemical and physical exposures. Exposure models will be developed for the full cohorts totaling 32,000 mother–child pairs, and biomarkers will be measured in a subset of 1,200 mother–child pairs. Nested repeat-sampling panel studies (n = 150) will collect data on biomarker variability, use smartphones to assess mobility and physical activity, and perform personal exposure monitoring. Omics techniques will determine molecular profiles (metabolome, proteome, transcriptome, epigenome) associated with exposures. Statistical methods for multiple exposures will provide exposure–response estimates for fetal and child growth, obesity, neurodevelopment, and respiratory outcomes. A health impact assessment exercise will evaluate risks and benefits of combined exposures. Conclusions: HELIX is one of the first attempts to describe the early-life exposome of European populations and unravel its relation to omics markers and health in childhood. As proof of concept, it will form an important first step toward the life-course exposome

    Linear, Deterministic, and Order-Invariant Initialization Methods for the K-Means Clustering Algorithm

    Full text link
    Over the past five decades, k-means has become the clustering algorithm of choice in many application domains primarily due to its simplicity, time/space efficiency, and invariance to the ordering of the data points. Unfortunately, the algorithm's sensitivity to the initial selection of the cluster centers remains to be its most serious drawback. Numerous initialization methods have been proposed to address this drawback. Many of these methods, however, have time complexity superlinear in the number of data points, which makes them impractical for large data sets. On the other hand, linear methods are often random and/or sensitive to the ordering of the data points. These methods are generally unreliable in that the quality of their results is unpredictable. Therefore, it is common practice to perform multiple runs of such methods and take the output of the run that produces the best results. Such a practice, however, greatly increases the computational requirements of the otherwise highly efficient k-means algorithm. In this chapter, we investigate the empirical performance of six linear, deterministic (non-random), and order-invariant k-means initialization methods on a large and diverse collection of data sets from the UCI Machine Learning Repository. The results demonstrate that two relatively unknown hierarchical initialization methods due to Su and Dy outperform the remaining four methods with respect to two objective effectiveness criteria. In addition, a recent method due to Erisoglu et al. performs surprisingly poorly.Comment: 21 pages, 2 figures, 5 tables, Partitional Clustering Algorithms (Springer, 2014). arXiv admin note: substantial text overlap with arXiv:1304.7465, arXiv:1209.196

    The p.V37I Exclusive Genotype Of GJB2: A Genetic Risk-Indicator of Postnatal Permanent Childhood Hearing Impairment

    Get PDF
    Postnatal permanent childhood hearing impairment (PCHI) is frequent (0.25%–0.99%) and difficult to detect in the early stage, which may impede the speech, language and cognitive development of affected children. Genetic tests of common variants associated with postnatal PCHI in newborns may provide an efficient way to identify those at risk. In this study, we detected a strong association of the p.V37I exclusive genotype of GJB2 with postnatal PCHI in Chinese Hans (P = 1.4×10−10; OR 62.92, 95% CI 21.27–186.12). This common genotype in Eastern Asians was present in a substantial percentage (20%) of postnatal PCHI subjects, and its prevalence was significantly increased in normal-hearing newborns who failed at least one newborn hearing screen. Our results indicated that the p.V37I exclusive genotype of GJB2 may cause subclinical hearing impairment at birth and increases risk for postnatal PCHI. Genetic testing of GJB2 in East Asian newborns will facilitate prompt detection and intervention of postnatal PCHI

    Suicide attempts in bulimia nervosa: Personality and psychopathological correlates

    Get PDF
    Background: Little evidence exists about suicidal acts in eating disorders and its relation with personality. We explored the prevalence of lifetime suicide attempts (SA) in women with bulimia nervosa (BN), and compared eating disorder symptoms, general psychopathology, impulsivity and personality between individuals who had and had not attempted suicide. We also determined the variables that better correlate with of SA. Method: Five hundred sixty-six BN outpatients (417 BN purging, 47 BN non-purging and 102 subthreshold BN) participated in the study. Results: Lifetime prevalence of suicide attempts was 26.9%. BN subtype was not associated with lifetime SA (p = 0.36). Suicide attempters exhibited higher rates on eating symptomatology, general psychopathology, impulsive behaviors, more frequent history of childhood obesity and parental alcohol abuse (p < 0.004). Suicide attempters exhibited higher scores on harm avoidance and lower on self-directedness, reward dependence and cooperativeness (p < 0.002). The most strongly correlated variables with SA were: lower education, minimum BMI, previous eating disorder treatment, low self-directedness, and familial history of alcohol abuse (p < 0.006). Conclusion: Our results support the notion that internalizing personality traits combined with impulsivity may increase the probability of suicidal behaviors in these patients. Future research may increase our understanding of the role of suicidality to work towards rational prevention of suicidal attempts

    MRPS18CP2 alleles and DEFA3 absence as putative chromosome 8p23.1 modifiers of hearing loss due to mtDNA mutation A1555G in the 12S rRNA gene

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Mitochondrial DNA (mtDNA) mutations account for at least 5% of cases of postlingual, nonsyndromic hearing impairment. Among them, mutation A1555G is frequently found associated with aminoglycoside-induced and/or nonsyndromic hearing loss in families presenting with extremely variable clinical phenotypes. Biochemical and genetic data have suggested that nuclear background is the main factor involved in modulating the phenotypic expression of mutation A1555G. However, although a major nuclear modifying locus was located on chromosome 8p23.1 and regardless intensive screening of the region, the gene involved has not been identified.</p> <p>Methods</p> <p>With the aim to gain insights into the factors that determine the phenotypic expression of A1555G mutation, we have analysed in detail different genetic and genomic elements on 8p23.1 region (<it>DEFA3 </it>gene absence, <it>CLDN23 </it>gene and <it>MRPS18CP2 </it>pseudogene) in a group of 213 A1555G carriers.</p> <p>Results</p> <p>Family based association studies identified a positive association for a polymorphism on <it>MRPS18CP2 </it>and an overrepresentation of <it>DEFA3 </it>gene absence in the deaf group of A1555G carriers.</p> <p>Conclusion</p> <p>Although none of the factors analysed seem to have a major contribution to the phenotype, our findings provide further evidences of the involvement of 8p23.1 region as a modifying locus for A1555G 12S rRNA gene mutation.</p
    corecore